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ACCESSING SHORTER-DURATION INSTANCES OF 
ACTIVATABLE OBJECTS BASED ON OBJECT REFERENCES STORED 
IN LONGER-DURATION MEMORY 
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OBJECT REFERENCES STORED IN LONGER-DURATION MEMORY, filed on 
February 25, 2000 by Harlan Sexton et al. (docket 50277-332; OID-1999-084-01PRO). 
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SYSTEM WITHIN A RUN-TIME ENVIRONMENT, filed on February 1 1, 1999 by Harlan 
Sexton et al. (docket 50277-178; OID-1997-048-01); 

U.S. Patent Application Serial No. 09/248,291 entitled MACHINE INDEPENDENT 
MEMORY MANAGEMENT SYSTEM WITHIN A RUN-TIME ENVIRONMENT, filed on 
February 11, 1999 by Harlan Sexton etal. (docket 5Q111-\11; OID- 1997-048-02); 

U.S. Patent Application Serial No. 09/248,294 entitled ADDRESS CALCULATION 
OF INVARL\NT REFERENCES WITHIN A RUN-TIME ENVIRONMENT, filed on 
February 11, 1999 by Harlan Sexton et al. (docket 50277-179; OID- 1997-048-03); 
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U.S. Patent Application Serial No. 09/248,297 entitled PAGED MEMORY 
MANAGEMENT SYSTEM WITHIN A RUN-TIME ENVIRONMENT, filed on February 
11, 1999 by Harlan Sexton a/, (docket 50277-261; OID-1997-048-17); 

U.S. Patent Application Serial No. 09/320,578 entitled METHOD AND ARTICLE 
5 FOR ACCESSING SLOTS OF PAGED OBJECTS, filed on May 27, 1 999 by Harlan Sexton 
et al. (docket 50277-293; OID-1998-034-01); 

U.S. Patent Application Serial No. 09/320,578 entitled METHOD AND ARTICLE 
FOR MANAGING REFERENCES TO EXTERNAL OBJECTS IN A RUNTIME 
ENVIRONMENT, filed on May 27, 1999 by Harlan Sexton et al. (docket 50277-293; OID- 
10 1998-034-01); 

U.S. Patent Application Serial No. 09/512,619 entitled METHOD FOR MANAGING 
MEMORY USING EXPLICIT, LAZY INITALIZATION IN A RUN-TIME 
ENVIRONMENT, filed on February 25, 2000 by Harlan Sexton et al. (docket 50277-214; 
OlD-1997-048-07); 

1 5 U.S. Patent Application Serial No. 09/5 12,622 entitled METHOD FOR MANAGING 

MEMORY USING ACTIVATION-DRIVEN INITIALIZATION IN A RUN-TIME 

ENVIRONMENT, filed on February 25, 2000 by Harlan Sexton et al. (docket 50277-213; 

OID- 1997-048-08); 

U.S. Patent Application Serial No. 09/512,621 entitled SYSTEM AND 
20 METHODLOGY FOR SUPPORTING A PLATFORM INDEPENDENT OBJECT 

FORMAT FOR A RUN-TME ENVIRONMENT, filed on February 25, 2000 by Harlan 

Sexton etal. (docket 50277-258; OID-1997-048-14); 

U.S. Patent Application Serial No. 09/512,618 entitled METHOD AND 

APPARATUS FOR MANAGING SHARED MEMORY IN A RUN-TME 
25 ENVIRONMENT, filed on February 25, 2000 by Harian Sexton et al. (docket 50277-259; 

OID- 1997-048- 15); and 
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U.S. Patent Application Serial No. 09/512,620 entitled USING A VIRTUAL 
MACHINE INSTANCE AS THE BASIC UNIT OF USER EXECUTION IN A SERVER 
ENVIRONMENT, filed on February 25, 2000 by Harlan Sexton et al (docket 50277-403; 
OID-1997-048-19). 

5 FIELD OF THE INVENTION 

The present invention relates to handling references to objects and, more specifically, 
to accessing shorter-duration instances of activatable objects based on object references 
stored in longer-duration memory. 

BACKGROUND OF THE INVENTION 

10 Computer languages that support independent constructed program units must resolve 

symbolic references from one program unit to an "extemal" object in another program unit. 
For example, most C and C++ programming language environments support separate 
compilation of source code files into object code files. When a first source code file 
references an object defined in a second source code file, the compiler typically creates an 

15 "external reference" to that object in the object code that it generates when it compiles the 
first source code file. The external reference within the object code is eventually resolved 
into an actual address. 

A number of approaches have been developed for resolving external references. For 
example, with a "static linking" approach, extemal references are resolved at link time, i,e, 

20 when the separately compiled object code files are combined together to form a complete, 

executable program. Another approach is "dynamic linking," in which some object code files 
are grouped into modules typically called "dynamic link libraries" or "DLLs." When a 
program linked with DLLs is executed, the DLLs are not combined with the program until 
the program is loaded into memory for execution. In the dynamic linking approach, extemal 

25 references are resolved at load time. 
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One popular programming language that supports separate compilation is JAVA, a 
platform-independent, object-oriented language developed by Sun Microsystems. In JAVA, 
the attributes and methods for a class of objects are typically defined in its own source file, 
which is separately compiled into an architecture-neutral object file containing bytecodes that 
5 are interpreted by the target platform. Since it is common for one class to reference objects 
that belong to another class, JAVA source files will commonly require external references to 
other objects and their classes. 

Lately, there has been much interest in using JAVA as a development language for 
applications that interact with a relational database system. A relational database system 

10 stores information in tables organized into rows and columns. To retrieve the information, a 
database application submits queries and other "calls" to a database server process, which 
determines which information in the tables satisfies the request. 

A multi-user database allows multiple users to connect to the database concurrently in 
many separate sessions, and typically only a few of the sessions are actively using the 

15 database at any particular time by making calls. When designing a run-time environment for 
database applications, scalability in terms of the number of simultaneous users who can 
establish separate sessions is very important. A significant constraint for user scalability is 
the size of the memory "footprint" that each session consumes, and it is desirable to reduce 
the session memory footprint to improve scalability. 

20 JAVA defines the lifetime of many objects, especially system objects, to extend 

throughout the duration of a session. Some of these objects, however, are infi-equently used. 
Resolving extemal references to such objects at link time or load time according to 
conventional approaches entails loading these objects when the run-time environment is 
executed, either because they are statically part of the executable image or are dynamically 

25 loaded from a DLL. These objects consume session memory and reduce the user scalability 
of the database system. 
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Since session memory is allocated for each active client session, user scalability is 
improved by reducing the memory requirements of session memory. Memory requirements 
for session memory may be reduced by deferring the allocation of external objects in session 
memory until those external objects are actually accessed. This is referred to as lazy 
evaluation for resolving external objects. Li particular, these external objects (whose 
allocation in session memory is designated to be deferred until they are accessed) are so 
designated by creating extemal references to them, for example by implementers of the run- 
time environment, hi addition, if the extemal objects are easily "recreatable," the objects can 
be deallocated and later allocated (or found in a shared memory) through the lazy evaluation 
process. 

One example of an extemal object v^hose allocation would be desirable to defer is a 
class definition for a required, but rarely used system class, such as a system error class. This 
class typically requires much memory to be allocated for static member variables, for 
example, for error messages that are output as well as for the byte codes of the error handling 
instructions. Therefore, allocating a system error class in session memory increases the 
session memory footprint and thereby reduces the number of users who can establish a 
session at one time, even though that allocated session memory is rarely used. Accordingly, 
the allocation of the class definition for such a class is deferred until the class is actually 
used. Furthermore, even when the class definition is allocated upon access, the class 
definition can be allocated in non-session memory. 

Objects stored in longer-duration memory may themselves include numerous 
recreatable external references. For example, the code for a particular class may use 
recreatable extemal references to identify other classes, such as the superclass of the 
particular class. In addition, objects in longer-duration memory may use recreatable extemal 
references to refer to objects that reside in a constant pool. 

A constant pool is an area of memory that holds the data for all of the literal values 

used in a program's code. Such data may include, for example, text strings and numeric 
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constants. Typically, literals are handled by creating a table of pointers, where each of the 
pointers points to a literal value. When code that references a literal value is compiled, the 
literal value is assigned to a global variable, and the code that accesses the literal value does 
so through a reference to the global variable. To avoid loading a potentially large amount of 
5 data associated with such global variables, they may be accessed using recreatable external 
references. This is also used to maintain semantics if objects from different class files might 
be equivalent. 

In addition to recreatable extemal references associated with global variable that 
correspond to literals in a constant pool, longer-duration memory may also contain 

10 recreatable extemal references to other types of objects. For example, longer-duration 

memory may contain recreatable extemal references to methods identified in a method table 
within longer-duration memory. The method table may be accessed, for example, when a 
method in a particular class makes a call to another method in the same class. 

When recreatable extemal references to objects in shorter-duration memory are used 

1 5 in longer-duration memory, it resuhs in a significant performance penalty because the objects 
in the longer-duration memory do not contain information specific to any particular call. 
However, because the recreated objects are reaUzed on a per-call basis, the cached values 
array resides in the call memory, which is private. Consequently, recreatable extemal 
references in longer-duration memory do not contain pointers to the corresponding cached 

20 values arrays. However, the cached values array that corresponds to an recreatable extemal 
reference must be inspected every time the recreatable extemal reference is dereferenced in 
order to determine whether the referenced object has already been realized, and if so, where 
the realized instance of the object is located. Thus, every time a recreatable extemal 
reference in longer-duration memory is dereferenced, relatively expensive measures must be 

25 taken to locate the appropriate private cached values array to determine whether the 
referenced object has already been realized. 
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Unfortunately, in systems that use lazy evaluation for resolving recreatable external 
references, the runtime performance penalty for dereferencing such external references may 
be significant. Based on the foregoing, it is desirable to provide a system that allows lazy 
evaluation of external references, but reduces the runtime performance penalty. 
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SUMMARY OF THE INVENTION 

Techniques are provided for accessing an instance of a recreatable object in a shorter- 
duration memory based on a reference located in a longer-duration memory, where the 
shorter-duration memory is associated with a call. One technique involves (1) locating, 
5 within the shorter-duration memory, a context structure associated with the call; (2) locating 
an cached values array based on data cached within the context structure; (3) determining 
whether the cached values array includes a pointer associated with the reference; and (4) if 
the cached values array includes a pointer associated with the reference, then following the 
pointer to locate the instance within the shorter-duration memory. Another technique, which 
10 may be employed in addition to or instead of the first technique, involves: (1) when a class is 
activated, generating, within the shorter-duration memory, a class object associated with the 
class; (2) storing, within the class object, data for locating instances of recreatable objects 
associated with the class; (3) to dereference the reference, performing the steps of (a) 
determining that the reference is associated with the class; and (b) using the data within the 



15 class object to locate the instance of the recreatable object. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example, and not by way of limitation, 
in the figures of the accompanying drawings and in which like reference numerals refer to 
similar elements and in which: 
5 FIG. 1 is a block diagram of a computer system upon which an embodiment of the 

invention may be implemented. 

FIG. 2 is a schematic drawing of database server memory model for use with an 
embodiment. 

FIG. 3 is a flowchart for resolving recreatable extemal references. 
10 FIG. 4 is a schematic drawing of a layout of an extemal reference in accordance with 

an embodiment. 

FIG. 5 is a flowchart for generating an extemal reference from a pointer to a stub 

object. 

FIG. 6 is flowchart for dereferencing an extemal reference. 

15 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Techniques for dereferencing XREFs that reside in longer-duration memory are 
described. In the following description, for the purposes of explanation, numerous specific 
details are set forth in order to provide a thorough understanding of the present invention. It 
5 will be apparent, however, to one skilled in the art that the present invention may be practiced 
without these specific details. In other instances, well-known structures and devices are 
shown in block diagram form in order to avoid unnecessarily obscuring the present invention. 



10 environment for a language such as JAVA is used. In the illustrated configuration, client 
processes 252, 254, 256, and 258 estabUsh database sessions with the database system 200. 
A database session refers to the establishment of a connection between a client and a database 
system through which a series a calls may be made. As long as the client remains connected 
in the database session, the client and the associated database session are referred to as being 

15 active. Active clients can submit calls to the database system 200 to request the database 
system 200 to perform tasks. One example of a call is a query in accordance with the 
Structured Query Language (SQL), and another example is a method invocation of a JAVA 
object or class, defined for performing a database task for database system 200. 



20 for storing information useful for processing calls and a number of server processes 213 and 
217 for handling individual calls. The database memory 202 includes various memory areas 
used to store data used by server processes 213 and 217. These memory areas include a 
database instance memory 220, session memories 222, 224, 226, and 228, and call memories 
223 and 227. It is to be understood that the number of the session memories and call 

25 memories in FIG. 2 is merely illustrative and, in fact, the number of such memories will vary 
over time as various clients make various calls to the database system 200. 



MEMORY Model 



FIG. 2 schematically illustrates a database system 200 with which a run-time 



Database system 200 comprises, among other components, a database memory 202 
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The database instance memory 220 is a shared memory area for storing data that is 
shared concurrently by more than one process. For example, longer-duration memory area 
may be used store the read-only data and instructions {e.g. bytecodes of JAVA classes) that 
are executed by the server processes 213 and 217. The database instance memory 220 is 
5 typically allocated and initialized at boot time of the database system 200, before clients 
connect to the database system 200. 

When a database session is created, an area of the database memory 202 is allocated 
to store information for the database session. As illustrated in FIG. 2, session memories 222, 
224, 226, and 228 have been allocated for clients 252, 254, 256, and 258, respectively, for 

10 each of which a separate database session has been created. Session memories 222, 224, 226, 
and 228 are a shared memory used to store static data, i.e., data associated with a user that is 
preserved for the duration of a series of calls, especially between calls issued by a cUent 
during a single database session. JAVA class variables are one example of such static data. 
A call memory, such as call memory 227, is used to store data that is bounded by the 

15 lifetime of a call. When client 258 submits a call to the database system 200, one of server 
processes 213 or 217 is assigned to process the call. For the duration of the call, the server 
process is allocated a call memory for storing data and other information for use in 
processing the call. For example, server process 217 uses call memory 227 and session 
memory 228 for processing a call submitted by client process 258, Because the lifetime of 

20 objects in a call memory associated with a call is shorter than the lifetime of objects stored in 
the session memory associated with the session in which the call was made, call memory is 
said to be a "shorter-duration" memory relative to the session memory. Conversely, session 
memory is "longer-duration" memory relative to call memory. 

At any given time, a server process is assigned to process a call submitted by a single 

25 client. After the server process completes its processing of a call from one client, the server 

process is free to be assigned to respond to the call of another client. Thus, over a period of 

time, a server process may be assigned to process calls from multiple clients, and a client may 
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use multiple server processes to handle its various calls. At any given time, the number of 
calls requiring execution by a server process is typically much fewer than the current number 
of active clients. Thus, database system 200 is typically configured to execute fewer server 
processes than the maximum nxmiber of active clients. 
5 "Virtual memory" refers to memory addressable by a storage allocation technique in 

which auxiliary storage, such as storage on a disk drive, can be addressed as though it were 
part of the main memory of a machine. More specifically, combinations of hardware, 
firmware, and operating system cooperate to automatically swap portions of the code and 
data for an executing process on an as-needed basis. Thus, the virtual address space may be 
10 regarded as addressable main memory to a process executing on a computer system that maps 
virtual addresses into real addresses. The size of the virtual address space is usually limited 
by the size of a native machine pointer, but not by the actual number of storage elements in 
main memory. 

"Secondary storage" as used herein refers to storage elements, other than virtual 
15 memory, accessible to a process. Secondary storage may be local or networked. Local 

secondary storage is frequently a random access storage device such as a magnetic or optical 
disk. Networked secondary storage is provided by storage devices on other computer 
systems. 

EXTERNAL OBJECTS 

20 FIG. 3 depicts a flowchart illustrating the memory management involving the lazy 

evaluation of external references. At step 300, a stub object for the external object is created 
in session memory. The stub object includes a "descriptor," which is a group of data that 
identifies the external object and is sufficient to create or recreate the external object. 
Typically, the descriptor is a string that names the object or includes directions for loading 

25 and initializing the external object, for example, from an identified file in secondary storage. 
Furthermore, the stub object is associated with a "realize routine," which can be a static 
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member function or a generic function for the stub object's class. The reaUze routine is 
responsible for creating or recreating the extemal object using the information encoded in the 
descriptor. The implementation of the realize routine and descriptor is dependent on the 
particular extemal object and generally is written specifically for that extemal object or for 
5 extemal objects belonging to a particular class. 

The stub object may be marked "activatable", for example by setting an appropriate 
bit in the object header, to distinguish the stub object from other objects whose loading into 
session memory is not to be deferred. Thus, an activatable stub object is a proxy for another 
object that will be recreated upon demand using information stored in the activatable stub 
10 object. 

RuN-TiME External References 
At step 302 in FIG. 3, when a pointer to the stub object is assigned to a slot in an 
object, for example during initialization of the object, a run-time extemal reference (XREF) 
is generated based on the pointer to the stub object and stored in the slot instead of the pointer 

15 to the stub object. A run-time extemal reference is a reference to an object, such that 
dereferencing the run- time extemal reference causes the extemal object to be loaded into 
virtual memory, if the object is not currently loaded. 

FIG. 4 illustrates a possible layout for a run-time extemal reference 400. In order for 
run-time extemal references to coexist with other types of references, such as machine 

20 pointers and numeric references to loaded objects, a run-time extemal reference preferably 
includes a tag 402 that indicates that the reference is a run-time external reference. Non- 
extemal references may also be tagged, but with a different value to distinguish themselves 
from mn-time extemal references and to indicate other storage properties of the objects they 
reference. In implementations that do not employ reference tagging, this information can be 

25 stored in the header or body of the object, for example by deriving all extemal objects from a 
base class for recreatable objects. 
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In one implementation of reference tagging, a certain number of bits in a reference, 
for example the higher-order bits or lower-order bits, is reserved for distinguishing the run- 
time extemal references from other references. The information embedded within the 
reference, which is likely to be sitting in a fast-access machine register, can therefore be 
5 retrieved very quickly, without requiring additional memory cycles to fetch the header of the 
referenced object. 

A preferred implementation of reference tagging introduces an alignment invariant 
and then exploits the alignment invariant in a run-time environment to encode the run-time 
extemal reference format in the lower-order bits. Specifically, objects managed by the run- 

10 time environment are stored at an N-bit aligned address, or, in other words, the storage for 
these objects begins at virtual addresses at 2^-byte boundaries. For example, if the objects 
can be stored at three-bit aUgned addresses, that is, on 2^=8 byte boundaries, a legal start 
address for such an object might be 0x20446740, but an address such as 0x20446743 is not a 
valid start address for the storage of an object. 

1 5 Consequently, the three least significant bits of the reference do not serve to 

differentiate different objects, since only one of the eight values for the three least significant 
bits is a legal address and the remaining seven values do not point to any other object. Given 
this alignment restriction, references that resolve to addresses 0x20446740 through 
0x20446747 effectively refer to the same object. Therefore, any of the N least significant bits 

20 of a reference to an N-bit aligned object can be used as a tag to encode other information, for 
example, whether the referenced object is an extemal object to be loaded upon use. 

A three-bit tag in a thirty-bit reference will theoretically allow for 2^^'^ = 2^^, or about 
half a billion, different mn-time extemal references. Since this number of mn-time extemal 
references is well in excess of what is needed in most, if not all, mn-time environments and 

25 since the number of different three-bit tag values is limited (i.e. to eight), the same three-bit 

tag value for mn-time extemal references is preferably shared with other kinds of extemal 

reference. The tag value used by mn-time extemal references may be shared with indexed 
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references. An indexed reference is a type of reference that evaluates to an array entry of one 
sort or another. 

Accordingly, some of the non-tag bits of an indexed reference specify the array and 
other bits specify an index into the array. In the case of a run-time extemal reference 400 in 
5 FIG. 4, array bits 404 specify the arrays that are used to implement run-time extemal 

references, namely a descriptor array and a XREF pointers array, which functions as a cahced 
values array. The remaining, index bits 406 specify the common index into the descriptor 
array and the XREF pointers array. In one embodiment, the descriptor array is allocated in 
session memory, e.g. as descriptor arrays 232, 236, and 238 in sessions memories 222, 226, 
10 and 228, respectively, while the XREF pointers array can be allocated in non-session 

memory, for example, XREF pointers array 243 in call memory 223 or XREF pointers array 
240 in database instance memory 220. 

Creating Run-Time External References 
FIG. 5 is a flowchart illustrating steps taken in creating a run-time extemal reference 
15 for an extemal object based on a pointer to a stub object for the extemal object in accordance 
with an embodiment. At a step 500, a pointer to stub object is obtained for assignment in a 
slot of an object within the mn-time environment. Typically, this pointer value is obtained 
from a variable passed into a function, retumed from a function or memory allocation 
operator, loaded from a memory address not part of the program state, or produced by 
20 dereferencing a reference to the stub object. 

During the assignment procedure in step 502, the pointer is dereferenced at a zero or 
negative offset to check a bit in the header of the object for determining whether the 
dereferenced object is "activatable." An activatable object is an object that can be allocated 
and initialized upon dereference. If the object is not activatable, then the non-extemal 
25 reference is generated as appropriate for the run-time environment (step 504). For example, 
if the mn-time environment uses tagged machine pointers to reference objects, then an 
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appropriate tagged machine pointer is generated. As another example, if the run-time 
environment uses tagged numeric references to reference objects in session memory, then an 
appropriate numeric reference is generated. This reference is stored in the slot of the object. 
If, on the other hand, the object is activatable, the referenced object is a stub object 
5 and therefore contains a descriptor for activating the object or, in some embodiments, a 
pointer to the descriptor. Since multiple external references may be generated for the same 
external object, the descriptor, which uniquely identifies the external object, is used to 
perform a look up in a data structure, such as a hash table, to determine whether the external 
object was already referenced (step 506). If a hash table entry corresponding to the descriptor 

10 is not found in the hash table, then the descriptor has not yet been assigned an index value. 
Consequently, a new index value is assigned for the descriptor (and therefore for the external 
object), and the index value stored in the appropriate hash table entry of hash table (step 508). 
Any one of a number of well-known collision handling techniques may be used to address the 
situation in which more than one descriptor hashes to the same hash table entry. 

15 On the other hand, if an index entry associated with the descriptor is found in the hash 

table, then the previously assigned index value for the descriptor (and therefore the extemal 
object) is obtained from the hash table. At step 510, the obtained index value, whether newly 
assigned or fetched from the hash table, is stored in the in the index portion 406 of the run- 
time extemal reference 400. 

20 After the run-time extemal reference has been generated during an assignment 

operation, the mn-time extemal reference is stored in the slot of an object, just like any other 
reference in the mn-time environment is stored in the slot. 

Creating and Recreating Activatable Objects 
Referring again to FIG. 3, a mn-time extemal reference is dormant until the run-time 
25 extemal reference is dereferenced during operation of the run-time environment. 

Dereferencing a mn-time extemal reference refers to resolving the reference into a machine 
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pointer and accessing the memory at an address indicated by the machine pointer. When the 
run-time external reference is dereferenced during operation of the run-time environment, the 
memory for the external object referenced by the run-time external reference may be 
allocated and initialized, frequently in non-session memory (step 304). The external 
5 reference is often to shared memory, for example, to a shared part of a Java class. 

FIG. 6 is a flowchart illustrating steps taken in resolving a run-time external reference 
into a machine pointer to a loaded, external reference. At step 600, a run-time external 
reference is obtained, typically by reading the run-time external reference from a slot in an 
object at a given memory address. Assuming the tag value of the reference indicates that the 

10 reference is indeed a run-time external reference, the index portion 406 of the run-time 
external reference 400 is extracted (step 602). 

At step 604, the XREF pointers array, e.g, XREF pointers array 243 for client process 
252, is checked at the array location that corresponds to the extracted index. The XREF 
pointers array 243 stores references to previously loaded or otherwise resolved external 

15 objects. If an external object is not resolved, then the corresponding reference in the XREF 
pointers array 243 is null. Therefore, if the reference in the XREF pointers array 243 is not- 
null, then the extemal object referenced by the run-time external reference is currently 
loaded. Under those circumstances, the reference in the XREF pointers array 243 is resolved 
into a machine pointer value and retumed (step 606). 

20 If, on the other hand, a null reference was found in the XREF pointers array 243, then 

the extemal object needs to be activated. Accordingly, the descriptor for the extemal object 
is fetched from the descriptor array entry located at same index location in the parallel 
descriptor array 232 (step 608). The descriptor array 232 entry for this index was previously 
set when the run-time extemal reference was originally assigned. The realize routine, 

25 associated with the stub object, such as a member function or a generic function, is executed, 
using the fetched descriptor value (step 610). 
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Running the realize routine causes memory for the object to be found or allocated and 
initialized. The implementation of the reaUze routine will vary from object to object, but the 
present invention is not limited to any particular implementation of the realize routine. For 
example, the realize routine may activate the extemal object by loading data in a file from 
5 secondary storage into a data structure for the array or by pointing to a shared object 230 in 
database instance memory 220 that was identified by hashing the descriptor. 

Since resolving an extemal object referenced by a run- time extemal object is deferred 
until the run-time extemal reference is dereferenced, it is not necessary to allocate memory, 
until the extemal object is actually used, thereby reducing the memory footprint and 

10 increasing scalability. Even when the extemal object is finally loaded, the bulk of the extemal 
object is allocated and recreated as-needed in a non-session memory, such as call memory or 
database instance memory, which is another way of reducing the session memory footprint 
and increasing scalability. Thus, use of mn-time extemal references implements a systematic, 
lazy evaluation for extemal references. 

15 Objects stored in longer-duration memory may themselves include numerous 

recreatable extemal references. For example, the code for a particular class may use 
recreatable extemal references to identify other classes, such as the superclass of the 
particular class. In addition, objects in longer-duration memory may use recreatable extemal 
references to refer to objects that reside in a constant pool. 

20 A constant pool is an area of memory that holds the data for all of the literal values 

used in a program's code. Such data may include, for example, text strings and numeric 
constants. Typically, literals are handled by creating a table of pointers, where each of the 
pointers points to a literal value. When code that references a literal value is compiled, the 
literal value is assigned to a global variable, and the code that accesses the literal value does 

25 so through a reference to the global variable. To avoid loading a potentially large amount of 
data associated with such global variables, they may be accessed using recreatable extemal 
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references. This is also used to maintain semantics if objects from different class files might 
be equivalent. 

In addition to XREFs associated with global variable that correspond to literals in a 
constant pool, longer-duration memory may also contain XREFs to other types of objects. 
5 For example, longer-duration memory may contain XREFs to methods identified in a method 
table within longer-duration memory. The method table may be accessed, for example, when 
a method in a particular class makes a call to another method in the same class. 

When XREFs to objects in shorter-duration memory are used in longer-duration 
memory, it results in a significant performance penalty because the objects in the longer- 

10 duration memory do not contain information specific to any particular call. However, because 
the recreated objects are realized on a per-call basis, the XREF pointers array 243 resides in 
the call memory 223, which is private. Consequently, XREFs in longer-duration memory do 
not contain pointers to the corresponding XREF pointers arrays. However, the XREF 
pointers array that corresponds (t$> an XREF must be inspected every time the XREF is 

15 dereferenced in order to determine whether the referenced object has already been realized, 
and if so, where the realized instance of the object is located. Thus, every time an XREF in 
longer-duration memory is dereferenced, relatively expensive measures must be taken to 
locate the appropriate private XREF pointers array to determine whether the referenced 
object has already been realized. 

20 Unfortunately, in systems that use lazy evaluation for resolving recreatable extemal 

references, the runtime performance penalty for dereferencing such extemal references may 
be significant. Based on the foregoing, it is desirable to provide a system that allows lazy 
evaluation of extemal references, but reduces the runtime performance penalty. 



of the activatable objects. Because the recreated objects are realized on a per-call basis, the 



LOCATING THE XREF POINTERS ARRAY USING A HASH TABLE 



25 



As described above, the XREF pointers array 243 includes pointers to the realization 
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XREF pointers array 243 resides in the call memory 223. At the end of a call, the activatable 
objects are deallocated, as is the XREF pointers array 243 and all other structures within the 
call memory 223 associated with the call. 

The appropriate XREF pointers array 243 must be consulted when an XREF is 
5 dereferenced in order to determine whether the corresponding object has been activated, and 
if so, where the activated instance of the object resides (step 604 in FIG. 6). When the XREF 
resides in longer-duration memory and the corresponding object resides in shorter-duration 
memory, a pointer to the appropriate XREF pointers array will not reside with the XREF in 
longer-duration memory. According to one embodiment, the longer-duration memory is 

10 paged, and the XREF pointers array that is associated with a particular page of longer- 
duration memory is located using a hash table constructed in the call memory. 

Specifically, a hash table that maps shared-memory pages with XREF pointers arrays 
is constructed in call memory. The hash code used to hash into the hash table is generated 
from the address of the corresponding shared-memory page. According to one embodiment, 

15 a hash code is generated for each shared-memory page by performing a cyclical redundancy 
check on the address of the shared-memory page. To avoid recalculating the hash code every 
time that it is needed, the hash code is preferably calculated once, and then stored at a known 
location within the shared-memory page to which it corresponds. Thus, given the address of 
any object in the shared-memory page, the hash code can be quickly determined by 

20 performing a mask operation on the address to locate the start of the page, and adding to the 
page address an offset that produces the address of the hash code stored within the page. 

Once the hash code of the page is located, the hash code (or some subset thereof) is 
used as the hash index into the hash table. A function call is then made to locate a hash table 
entry corresponding to the hash index. If the hash table contains no entry for the page, then 

25 an XREF pointers array has not yet been constructed for the shared-memory page. Under 

those conditions, (1) an XREF pointers array is constructed, (2) a hash table entry that 

associates the hash index with the new XREF pointers array is added to the hash table, (3) the 
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object associated with the XREF is activated based on the corresponding descriptor array 
entry, and (4) an entry is added to the new XREF pointers array to point to the newly 
activated object. 

If the hash table contains an entry, then the XREF pointers array associated with the 
entry is located based on a pointer within the hash table entry. The XREF pointers array is 
searched to determine whether the object associated with the XREF has been activated. If the 
object associated with the XREF has not been activated, then (1) the object associated with 
the XREF is activated based on the corresponding descriptor array entry, and (2) an entry is 
added to the new XREF pointers array to point to the newly activated object. 

If the object associated with the XREF has already been activated, then a pointer in 
the appropriate entry of the XREF pointers array is used to locate the activated instance of the 
object. 

LOCATING THE XREF POINTERS ARRAY USING POINTERS CACHED 

IN THE CONTEXT 

While the technique described above allows the XREF pointers array that is 
associated with a shared-memory page to be located, executing the function call to perform 
the hash lookup for every dereferencing operation imposes significant overhead. According 
to one embodiment, pointers to XREF pointers arrays are cached in a context structure, where 
the context structure is passed as an argument when a call is made. 

Specifically, when a call is made, an array is allocated in the context structure that is 
passed as a parameter to the call. Initially, the array will not contain any populated entries. 
When an XREF is encountered on a shared-memory page, the hash code of the page is 
located and the hash code (or some subset thereof), is extracted. However, rather than 
making a function call to locate a hash table entry, the portion of the hash code is used as an 
index into the array within the context structure. According to one embodiment, the size of 
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the array is 2^, and the number of bits of the hash code that are used to index into the array is 
N. Thus, every possible index value will have a corresponding a slot within the array. 

If the array contains no entry at the indexed location, then an XREF pointers array has 
not yet been constructed for the shared-memory page during that call. Under those 
5 conditions, (1) an XREF pointers array is constructed, (2) an array entry that points to the 
new XREF pointers array is added to the array, (3) the object associated with the XREF is 
activated based on the corresponding descriptor array entry, and (4) an entry is added to the 
new XREF pointers array to point to the newly activated object. 

If the array contains an entry at the indexed location, then it is determined whether the 

10 entry is for the correct shared-memory page. This step is performed because the array may be 
relatively small, and collisions may occur (i.e. different shared-memory pages may map to the 
same index value). According to one embodiment, the key of a shared-memory page is 
stored within the entry for the shared memory page. Collisions may be identified by 
comparing the key stored in an entry with the key of the shared-memory page that is the 

15 subject of the search. In the case of a collision, any one of a number of well-known collision 
handling techniques may be employed. The present invention is not limited to any particular 
collision handling technique. In general, upon identifying a collision, a search for the correct 
entry will continue until either the correct entry is found, or it is determined that the array 
does not contain an entry for the correct shared-memory page. If the array does not contain 

20 an entry for the correct shared-memory page, then the process described above is followed. 

If a correct entry is identified, then the XREF pointers array associated with the entry 
is located based on a pointer within the array entry. The XREF pointers array is searched to 
determine whether the object associated with the XREF has been activated. If the object 
associated with the XREF has not been activated, then (1) the object associated with the 

25 XREF is activated based on the corresponding descriptor array entry, and (2) an entry is 
added to the new XREF pointers array to point to the newly activated object. 
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If the object associated with the XREF has already been activated, then a pointer in 
the appropriate entry of the XREF pointers array is used to located the activated instance of 
the object. 

Accessing the array in the context structure is much cheaper than using the hash table 
5 because accessing the hash table requires a function call, while indexing into the array does 
not involve making a function call and does not incur the overhead associated with 
maintaining a hash table (e.g. collision-handling). 

LOCATING THE XREF POINTERS ARRAY USING POINTERS CACHED 

IN PRIVATE CLASS OBJECTS 
10 For reference patterns where the number of shared-memory pages being accessed is 

small and stable, using a shared-memory-page-to-XREF-pointers-array array within the 
context structure of a call works satisfactorily. However, another type of cache yields 
superior results when the XREFs are to constant pool variable values. 

Specifically, the code referencing the constant pool maintains a "session-private" 
15 version of each class (referred to herein as a "jom_active_class object") in addition to the 
shared version of that class (referred to herein as the "jom_class"). According to one 
embodiment, the session-private recreatable extemal reference state for an object memory is 
stored in the jom_active_class for that object memory. Because there is only one class in 
each shared object memory that holds class information, the association between classes and 
20 the object memories that contain them is 1-1 . Thus, when an XREF encoimtered in an object 
memory stored in longer-duration memory is a reference to a constant pool item, the 
jom_active_class object associated with the class in that object memory is inspected. 

If the jom_active_class object contains no pointer to an XREF pointers array, then an 
XREF pointers array has not yet been constructed for the corresponding jom_class during that 
25 call. Under those conditions, (1) an XREF pointers array is constructed, (2) a pointer that 
points to the new XREF pointers array is added to the jom_active_class object, (3) the object 
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associated with the XREF is activated based on the corresponding descriptor array entry, and 
(4) an entry is added to the new XREF pointers array to point to the newly activated object. 

If the jom_active_class object contains a pointer to the XREF pointers array, then the 
XREF pointers array associated with the class is located based on the pointer. The XREF 
5 pointers array is searched to determine whether the object associated with the XREF has been 
activated. If the object associated with the XREF has not been activated, then (1) the object 
associated with the XREF is activated based on the corresponding descriptor array entry, and 
(2) an entry is added to the new XREF pointers array to point to the newly activated object. 

If the object associated with the XREF has already been activated, then a pointer in 
10 the appropriate entry of the XREF pointers array is used to located the activated instance of 
the object. 

iJl In an alternative embodiment, the XREF pointers array for a class is automatically 

built in call memory at the time that the class is activated. Thus, even the first dereferencing 
operation will encounter a pointer, within the jom_active_class object, to an XREF pointers 
15 array. 



yj 



HARDWARE OVERVIEW 
Figure 1 is a block diagram that illustrates a computer system 100 upon which an 
embodiment of the invention may be implemented. Computer system 100 includes a bus 102 
or other communication mechanism for communicating information, and a processor 104 
20 coupled with bus 102 for processing information. Computer system 100 also includes a main 
memory 106, such as a random access memory (RAM) or other dynamic storage device, 
coupled to bus 102 for storing information and instructions to be executed by processor 104. 
Main memory 106 also may be used for storing temporary variables or other intermediate 
information during execution of instructions to be executed by processor 104. Computer 
25 system 100 fiirther includes a read only memory (ROM) 108 or other static storage device 
coupled to bus 102 for storing static information and instructions for processor 104. A 
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Storage device 110, such as a magnetic disk or optical disk, is provided and coupled to bus 
102 for storing information and instractions. 

Computer system 100 may be coupled via bus 102 to a display 112, such as a cathode 
ray tube (CRT), for displaying information to a computer user. An input device 114, 
5 including alphanumeric and other keys, is coupled to bus 102 for communicating inforaiation 
and command selections to processor 104. Another type of user input device is cursor 
control 116, such as a mouse, a trackball, or cursor direction keys for communicating 
direction information and command selections to processor 104 and for controlling cursor 
movement on display 112. This input device typically has two degrees of freedom in two 
10 axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify 
positions in a plane. 

The invention is related to the use of computer system 100 for implementing the 
techniques described herein. According to one embodiment of the invention, those 
techniques are implemented by computer system 100 in response to processor 104 executing 

15 one or more sequences of one or more instructions contained in main memory 106. Such 
instructions may be read into main memory 106 fi-om another computer-readable medium, 
such as storage device 110. Execution of the sequences of instructions contained in main 
memory 106 causes processor 104 to perform the process steps described herein. In 
alternative embodiments, hard-wired circuitry may be used in place of or in combination with 

20 software instructions to implement the invention. Thus, embodiments of the invention are 
not limited to any specific combination of hardware circuitry and software. 

The term "computer-readable medium" as used herein refers to any medium that 
participates in providing instructions to processor 104 for execution. Such a medium may 
take many forms, including but not limited to, non-volatile media, volatile media, and 

25 transmission media. Non-volatile media includes, for example, optical or magnetic disks, 

such as storage device 110. Volatile media includes dynamic memory, such as main memory 

106. Transmission media includes coaxial cables, copper wire and fiber optics, including the 

-25- 

50277-0459 




wires that comprise bus 102. Transmission media can also take the form of acoustic or Ught 
waves, such as those generated during radio-wave and infra-red data communications. 

Common forms of computer-readable media include, for example, a floppy disk, a 
flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-ROM, any other 
5 optical medium, punchcards, papertape, any other physical medium with patterns of holes, a 
RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a 
carrier wave as described hereinafter, or any other medium from which a computer can read. 

Various forms of computer readable media may be involved in carrying one or more 
sequences of one or more instructions to processor 104 for execution. For example, the 
10 instructions may initially be carried on a magnetic disk of a remote computer. The remote 
computer can load the instructions into its dynamic memory and send the instructions over a 

Q 

kil telephone line using a modem. A modem local to computer system 100 can receive the data 

01 on the telephone line and use an infra-red transmitter to convert the data to an infra-red 

Oi signal. An infra-red detector can receive the data carried in the infra-red signal and 

Iji 15 appropriate circuitry can place the data on bus 102. Bus 102 carries the data to main memory 

106, from which processor 104 retrieves and executes the instructions. The instractions 
L . received by main memory 1 06 may optionally be stored on storage device 110 either before 

^; or after execution by processor 104. 

^" Computer system 100 also includes a communication interface 118 coupled to bus 

20 102. Communication interface 118 provides a two-way data communication coupling to a 
network link 120 that is connected to a local network 122. For example, communication 
interface 118 may be an integrated services digital network (ISDN) card or a modem to 
provide a data communication connection to a corresponding type of telephone line. As 
another example, communication interface 118 may be a local area network (LAN) card to 
25 provide a data communication connection to a compatible LAN. Wireless links may also be 
implemented. In any such implementation, communication interface 118 sends and receives 
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electrical, electromagnetic or optical signals that carry digital data streams representing 

various types of information. 

Network link 120 typically provides data communication through one or more 

networks to other data devices. For example, network link 120 may provide a connection 
5 through local network 122 to a host computer 124 or to data equipment operated by an 

Internet Service Provider (ISP) 126. ISP 126 in turn provides data communication services 

through the world wide packet data communication network now commonly referred to as 

the "Intemet" 128. Local network 122 and Intemet 128 both use electrical, electromagnetic 

or optical signals that carry digital data streams. The signals through the various networks 
10 and the signals on network link 120 and through communication interface 118, which carry 

the digital data to and from computer system 100, are exemplary forms of carrier waves 

transporting the information. 

Computer system 100 can send messages and receive data, including program code, 

through the network(s), network link 120 and communication interface 118. In the Intemet 
15 example, a server 130 might transmit a requested code for an application program through 

Intemet 128, ISP 126, local network 122 and communication interface 118. In accordance 

with the invention, one such downloaded application implements the techniques described 

herein. 



20 storage device 1 10, or other non- volatile storage for later execution. In this manner, 
computer system 100 may obtain application code in the form of a carrier wave. 

In the foregoing specification, the invention has been described with reference to 
specific embodiments thereof It will, however, be evident that various modifications and 
changes may be made thereto without departing from the broader spirit and scope of the 

25 invention. The specification and drawings are, accordingly, to be regarded in an illustrative 
rather than a restrictive sense. 



The received code may be executed by processor 104 as it is received, and/or stored in 
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